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ABSTRACT 

The emergence of macroscopic variables can be effected through coarse 
graining. Despite practical and fundamental benefits conveyed by this partition- 
ing of state space, the apparently subjective nature of the selection of coarse 
grains has been considered problematic. We provide objective selection meth- 
ods, deriving from the existence of relatively slow dynamical time scales. Using 
a framework for nonequilibrium statistical mechanics developed by us, we show 
the emergence of both spatial variables and order parameters. Although signifi- 
cant objective criteria are introduced in the coarse graining, we do not provide a 
unique prescription. Most significantly, the grains, and by implication entropy, 
are only defined modulo a characteristic time scale of observation. 

1. Introduction 

Coarse grains lie at the foundations of statistical physics. They provide a 
technical cure for the famous paradoxes of the field, leading to mathematically 
rigorous justifications of irreversibility [1]. And they make intuitive physical sense. 
A related issue is the profound distinction between work and heat, forms of energy 
that enter macroscopic and microscopic degrees of freedom, respectively. But this 
too is the issue of coarse graining, since one can look on macroscopic degrees of 
freedom as the labels of coarse grains. 

Nevertheless, there is a severe unresolved problem. There is no law of Nature 
that defines the coarse grains. But there is a law of thermodynamics — the second — 
that presupposes them, in particular with respect to the heat/work distinction. The 
trouble with coarse grains though is that they seem subjective and technology de- 
pendent. One observer will smear over 10 -6 m and another, with better equipment, 
10 -8 m. One observer will throw all Carbon into the same grain, another will dis- 
tinguish 12 C from 14 C. The questions that arise from this are: 1) what picks the 
coarse grains, and 2) if there is arbitrariness, does it have physical implications, 
e.g., with respect to the second law? [2] 



In this article we show how an objectively defined dynamics picks coarse grains. 
Mostly these grains turn out to be defined with respect to coordinate space, a result 
that many people who do coarse graining, whether for quantum or classical systems, 
take as natural (for example, see [3] or [4]). But as will be demonstrated, by looking 
at the dynamics we will find that space is not the only criterion for coarse graining. 
Under appropriate circumstances, one also gets an order parameter, magnetization 
in the example we consider. This order parameter is recognized as an emergent 
quantity, and it is gratifying that indeed it emerges as a macroscopic degree of 
freedom from the many microscopic variables. Although we will work entirely in 
a classical context, the results parallel those seen in quantum mechanics, where 
grains should also satisfy a decoherence condition. Thus Halliwell [5] finds that 
local energy conservation makes local energy density a decohered quantity. 

A related observation is that at many levels, from hydrodynamics to electrody- 
namics, one averages over microscopic variables to emerge with quantities such as 
density or polarization. By establishing macroscopic laws for these quantities (e.g., 
the Navier-Stokes equation), one confirms that these are indeed appropriate quan- 
tities. Our question in this paper goes beyond this process. It asks, if we did not 
know from extensive experience what variables to choose, what would pick them? 

Our answer is basically, dynamics. The key to defining coarse grains is the 
existence of varying time scales. States are thrown into the same coarse grain if 
they are close to each other dynamically, meaning the system moves from one state 
to the other in a relatively short time. Note that in general our states will live 
in a large space, not necessarily related to ordinary configuration space (they are 
to be thought of as the microscopic states of a large system). It will turn out, 
however, that position does emerge from the coarse graining, essentially because 
particle-particle interactions are often local in space. We will argue nevertheless 
that temporal considerations are the essential criteria, not spatial. Part of this 
argument involves recalling the relation between coarse grains and irreversibility, 
for example following the exposition in Appendix A of [6] (cf. §7 below). 

We emphasize that the physics behind our idea is not new although it hardly 
could be said to enjoy universal consensus, nor has it been implemented, to our 
knowledge, in any systematic way. Our main goal is to show that the idea can be 
implemented within the context of our general method [7-10] for treating nonequi- 
librium statistical mechanics. Here is what Landau and Lifshitz said about macro- 
scopic variables [11]: 

. . . Owing to the comparative slowness of chemical reactions, equilibrium 
as regards the motion of the molecules will be reached . . . more rapidly than 
equilibrium as regards . . . the composition of the mixture. This enables 
us to regard the partial equilibria ... as equilibria at a given . . . chemical 
composition. 

The existence of partial equilibria leads to the concept of macroscopic 
states of a system. Whereas a mechanical microscopic description . . . spec- 
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ifies the coordinates and momenta of every particle, ... a macroscopic de- 
scription is one which specifies the mean values of the physical quantities 
determining a particular partial equilibrium, . . . 

As we will demonstrate, our nonequilibrium formalism provides a framework 
within which to translate this powerful physical idea into a specific coarse graining, 
hence a specific concept of "macroscopic." 

In Sec. 2 we introduce notation. Following that we develop notions of distance 
that arise from the dynamics alone. In Sections 4, 5 and 6 we give examples that 
examine these distances in a number of systems: Brownian motion (§4), the Ising 
model (§5) and heat flow (§6). In Sec. 7 we discuss these results. 

2. Notation and framework 

As developed in [7-10], we consider a system whose microscopic states take 
values in a state space Q of cardinality N < oo. The system's motion is a Markov 
process, X(t), in O with transition matrix R. (N.B., some authors define this matrix 
as the transpose of our R.) Thus 

R xy = " Pr(x <- yf = Pr(X(t + 1) = x | X(t) = y) . 

We assume R to be irreducible, so that it has a unique eigenvalue 1 with a strictly 
positive eigenvector po: 

^2 R xy = 1 and ^ R xyPo(y) = Po(x) > Vx . 

The eigenvalues ("A") of R are ordered by decreasing modulus (and increasing phase 
when applicable). Thus Ao = 1 > |Ai| > | A2 1 > — The corresponding right and 
left eigenvectors are respectively p k and Ak, and satisfy 

Rpk = AfcPfc , A k R = X k A k , k = 0, 1, . . . . (1) 

Although R may not be diagonalizable (and may require a Jordan form), we will 
assume that for the eigenvalues that concern us (those near 1) each eigenvalue 
possesses one or more eigenvectors. They are taken to be real. The orthonormality, 
(Ak\pi) = 5 k e, still leaves a single multiplicative factor for each pair (A kl pk)- The 
stationary state po is naturally normalized by YlPo( x ) = 1; which fixes Aq(x) = 1 
for all x. When detailed balance holds (which we do not assume in general), left 
and right eigenvectors are related by 

A k (x) = p k (x)/p (x) . 

Consistent with this, we demand, for any R, that the As be normalized by 

5>o0r) (A k (x)) 2 = l. 

X 
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2. 1 Coarse graining formalities 

Coarse graining is accomplished by lumping microstates into a macrostate. The 
coarse grains are (disjoint) subsets of Cl (covering all of Cl), labeled x and taking val- 
ues in a space Cl. Each x £ Cl can be written x = (x, u), with u a microscopic label, 
identifying the particular microstate of x that x is. The coarse grained stationary 
distribution is po(x) = ^2 u€ xPo(x, u). The coarse grained transition matrix, R, 
governs transitions in Cl and is given by 

s = p po(y,«i) 

n xy — - fl (£,u 2 )(j7,ui)' 



Po(y) 



for x 7^ y; each R(x, x) is adjusted so that columns sum to one. With this definition, 
po is the eigenvector of R with eigenvalue one, currents add [12] and other good 
properties are maintained. One can also rescale time. 

In this paper we will not be concerned with the coarse graining itself, concen- 
trating instead on how, in principle, one selects the coarse grains. 



3. Distance functions on Cl 

3. 1 Left eigenvectors and dynamical proximity 

In [9] we found a basic identity that allowed us to use eigenvalue near-degeneracy 
to arrive at near-reducibility of the transition matrix, interpreted as the breaking 
of a system into two or more phases. Here we use essentially the same identity in a 
more general way, with "near-degeneracy" replaced by the existence of a sequence 
of time-scales reflected in a clustering of eigenvalues near 1 [13]. 

For t > an integer, we call R l the t th power of R. From Eq. (1), \\Ak{y) = 
J2 X Ak(x)(R t ) xy . The quantity (-R*) xy is the distribution function for a system that 
at time zero was definitely in the state y. Thus we write p y (x, t) = (i?*) XJ/ . 

Suppose that Ai, A2, . . . , A p are close to 1 in magnitude. In contrast to [9] we 
do not require subsequent eigenvalues to be significantly smaller. Consider the iV 
p-vectors V(x) = (Ai(x), A 2 (x), . . . , A p (x)) in R p . Call the set of these vectors A 
and their convex envelope A. Since J2Po( x )Ak(x) = for all k, the p- vector is 
in A. Note that A is of codimension in R p [14]. In [9] we showed that there 
exist p + 1 points, zi, Z2, ■ ■ ■ , z p +±, such that (Ai(ze), A^izg)., A p (zg)) are the 
extremal points of the convex envelope (which contains 0) of all iV p-vectors. Call 
these extremal vectors Vi = V(zi), £ = 1, . . . ,p+ 1. From the eigenvector property 
of the As, it follows that for any V(x), x £ Cl 

(Ai^i(x), x 2 a 2 (x), . . . , \lA p {x)) = J2p*(y, t){Mv), My), My)) ( 2 ) 

yen 
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We write the left hand side of Eq. (2) as V(x)A t , with A the diagonal (pxp) matrix 
of eigenvalues. Recalling that J2 y p x (y,t) = 1, it is immediate that 

V e (l - A*) = 5>, £ (y,f) (Ve - V{y)) (3) 

y 

In [9] we showed that there exist linear forms hij, with coefficients < 1, such that 

hij (V e - V(y)) > for £ = 1, . . . , (p + 1) and j = 1, . . . ,p , (4) 

with equality only if y = zg. Eq. (4), coupled with an assumption on A p +i, was the 
basis for our conclusions on the existence of p + 1 phases. 

Our present use of Eq. (4) is to argue for the near equality of the values of 
the left eigenvectors, A, when points are dynamically close. This will motivate 
our definition of a distance function on O. Consider the left hand side of Eq. (3). 
Because Ai, . . . , A p are by assumption close to 1, this will be small. This requires 
the right hand side to be small. For given I we apply the p linear forms hgj, 
j = 1, . . . ,p to the equation. The sum now involves only positive terms. For this 
to be small (as demanded by the left-hand side of the equation) one or the other 
term in the product must be small, for every term in the sum. If p Ze (y, t) is small, 
the implication is that in time t very little probability flows from zg to y. However, 
when p Zf (y, t) is not small, all p positive linear forms must be small. This will be 
satisfied if the individual values \Ak(ze) — A^(y) \ are small. 

So motivated, we define a distance, d(x,y), as the square root of 

There is both relevant and irrelevant arbitrariness in this definition. For example, 
use of the square of the As rather than perhaps the fourth power probably does not 
much affect the grains. Of more significance is the parameter "£" in the exponent 
of Afc. This will be the time scale for the coarse graining. Through this explicit 
insertion we acknowledge that although in this article we do propose a well defined, 
objective coarse graining process, it is nevertheless dependent on a time scale. An 
alternative (but physically similar) way to insert a time scale would be to truncate 
the sum in Eq. (5) to (e.g.) k such that |Afc|* > 1 — 5 for some 5. 

3.2 Transition matrix based measures of dynamical proximity 

In §3.1 our desire was to use left eigenvectors, which we consider closely related 
to measurable quantities, to define dynamical proximity. We made use of the fact 
that left eigenvectors with eigenvalues near 1 take similar values (at u x" and "y") 
when the quantity p y (x,t) = (R^xy was "large." Now we go directly to p y (x,t). 
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As before, we have qualitative leeway, both in the functional form and in the 
choice of the time-scale. We take as a distance the square root of 



D t (x, y) = — log max 



« 




for x 7^ y . 



(6) 



The quantity e is small and positive and provides a maximum distance, since in- 
dividual matrix elements of R l may vanish (for finite t). The value of Dj(x,x) is 
irrelevant for our coarse graining considerations and can be taken to be zero. This 
"distance" may not be symmetric. 

3.3 Other definitions 

Other definitions are also possible. For example, let R xy = max (R xy , R yx ), 
which is not in general a stochastic matrix. Let 7 be a path of n steps from y to x, 
without repetitions and ^y(£) its £ th state. Then one can consider 



Using either of these quantities, a distance can be defined by distance = — log(r). 

Another measure of distance between states commonly used in studies of stochas- 
tic processes is the first hitting time. This measure (or rather its logarithm) would 
not be additive for tensor products, although it would satisfy an inequality. Like 
some of our measures it is not necessarily symmetric. 

3.4 "Distance" as a metric 

As noted above, what we call "distance" is not necessarily a metric or distance 
in the conventional sense [15]. The function "<i" of Eq. (5) clearly is: the triangle 
inequality follows as it does for any vector product. On the other hand, this metric 
has the disadvantage of not being additive for tensor products of spaces. By con- 
trast, "£)" of Eq. (6) has good product structure, but is neither guaranteed to be 
symmetric, nor need it satisfy the triangle inequality. Other definitions mentioned 
in Sec. 3.3 vary in having or not having some of these properties. 

3.5 Making grains 

As is implicit, we assume that the defining of a distance function provides the 
basis for selection of coarse grains. One could use known algorithms to choose clus- 
ters (which would be the grains) that minimize suitable functions of the internal 
grain distances [16]. Some cluster algorithms select special points to serve as nu- 
clei. The extremal points zt in our left eigenvector construction (Sec. 3.1) certainly 
present reasonable candidates for this distinction. 

For a non-symmetric distance function it would be reasonable to symmetrize 
by using (e.g.) max [D t (x, y), D t {y, x)] or [D t (x, y) + D t (y, x)]. This process may 



ri(x, y) = max 

7 




j 
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induce a loss of associativity: coarse graining in two steps can give different grains 
from those of a single operation (although with a given hierarchy of grains our 
coarse graining process of Sec. 2.1 is associative). Additivity of distance for tensor 
products can also be lost in this symmetrization. 

In the present article we both display distances and calculate clusters. Our 
algorithm is a Monte Carlo scheme, with an annealing protocol, that minimizes the 
sum of the internal distances within each grain. Generally we specify the number of 
grains, but allow grain size to vary (as part of the minimization scheme). A coarse 
graining was accepted when repeated passes with the algorithm did not lower the 
sum of the internal distances. 



4. Brownian motion and diffusion 



4-1 Single particle 

To show the features our definition we first give an example amenable to an- 
alytic treatment. Consider Brownian motion (or diffusion) on a ring, with equal 
probability to step in either direction. The ring consists of A sites, labeled 1, . . . , A, 
with periodic boundary conditions. Define the matrix B as 

/0 1 0... 0\ 
1 0... 
1 0... 

0... 1 
\1 0... 0/ 

A transition matrix for Brownian motion is R = al + (1 — a) i^B + £? T j /2, with 

< a < 1 and T indicating matrix transpose. This matrix is doubly stochastic, 
satisfies detailed balance, and has equal left and right eigenvectors. The eigenvalues 
and eigenvectors are 



B = 



Xk = a + (1 - a) COS0A; 



2ivk 
If 



k = 0,1, 



A 

y 



itfc(x) = A 1/2 cos (x4>k) , v k (x) = A 1/2 sin (xfa) 

"A" 



k 



1,2, 



x 



1,2, 



A 



where [w] indicates the integer part of w. For even A, Vk is zero for k = 0, A/2, 
and for odd A it vanishes for k = 0, in both cases giving the correct number of 
independent eigenvectors. 

For one dimensional motion of a single Brownian particle there is a simplifi- 
cation. The state space coincides with the configuration space, although in most 
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Figure 1. Distance from state #1 for Brownian motion on 50 sites. The figures 
differ in their time scale: from left to right, the values for t (of Eq. (6)) are 10, 100, 
1000. The maximum distance (cf. Eq. (6)) is 46. 



applications the state space is vastly larger. This is two-edged: it makes our job 
easier but may obscure a logical distinction. We later discuss more complex spaces 
built on Brownian motion, and the distinction should then be clear. 

We next look at the distance function using the prescription of Eq. (5) with 
t = 1. With not too much manipulation one obtains 



[N/2] 



sin 2 {{x - y)(/>k/2) 



(1 — a) N ^ sin 2 (0 fc /2) 
For large N this becomes [17] 

1V ' (l-a)nj sm 2 9 (1 - a) 

(x and y are integers). Thus the coarse grains reconstruct configuration space. 

Our second distance definition can also be approached analytically, bearing in 
mind that 

= Jf E [« + (!"«) cos M l sin 2 (i^J^ . 
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However, we instead illustrate the results graphically. In Fig. 1 we show i?* 1 versus 
x for three different values of the time scale, t. Note that all distances are symmetric 
(recalling our periodic boundary conditions) with respect to site #1. As before, and 
in all cases, configuration space is recovered. 

The three cases in Fig. 1 illustrate three characteristic observational situations. 
In the leftmost figure there is very good time resolution (relative to the dynamics of 
the process studied) and sites more than 10 units away from one another will never 
be put in the same grain by a clustering algorithm that rejects grains containing 
"infinitely" separated states. In the middle figure, t = 100, and although all grains 
are at "finite" distance, one can easily distinguish near from far. In the rightmost 
figure we use t = 1000, which leads to small value differences in the distance, ranging 
only from 3.87 to 3.96. In building grains this may be obscured by other factors (if 
this is part of a larger problem), leading perhaps to the inclusion of all these points 
in the same coarse grain. 

4-2 Multiparticle diffusion 

From Brownian motion of a single particle in one dimension, one can extend 
these results to the diffusion of a large number of particles. The underlying configu- 
ration space for all particles and all dimensions is the same as before (the integers, 1 
to N, with periodic boundary conditions), so that the overall state space is a tensor 
product of M copies of the configuration space, where M is the number of particles 
times the number of dimensions. 

For this system the coincidental one-to-one correspondence between state- 
space (O) and configuration space no longer holds. We examine the distance 
"£)" for this case. Microscopic states will be written x = (x±, X2, • • • , im) and 
V = (yi> V2-, ■ ■ ■ ) Dm)- The transition matrix is R xy = Yle^-xeye- It * s immediate 
from Eq. (6) that 



where "D" is either the 1- or M-dimensional distance, depending on its argument. 
Note that for the definition of Eq. (6) the additive property, (7), will be true when- 
ever one builds O as a tensor product. 

As a result of Eq. (7), coarse grains in O can be taken to be compact objects 
(perhaps hypercubes) in R M . The function "£)" is in fact close to being quadratic 
in distance (for points not too far apart). See Fig. 2, where, after subtracting 
an appropriate constant, we take the square root of this distance, a process that 
evidently gives a good approximation to a straight line. 

Using a coarse graining depending on this "D" alone does not lead to a notion 
of density, since the particles are distinguishable. This means that so long as each 
pair X£ and ye are close, the corresponding x and y will be close — despite the fact 
that the various values of (for different £) are not close. Thus the "center of the 
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Figure 2. Using distances from site #1 (for N = 75 and t = 100, using Eq. (6)), 
a constant is subtracted, and the square root taken. The result is plotted and is 
evidently well approximated by a straight line near the point, "1." 



swarm" or other such collective variables do not emerge from this distance function. 
This kind of emergence will be seen in other cases, as described below. 

5. Dynamic Ferromagnet 

The one-dimensional Ising model has the energy function 

N N 

h = —j y~v fc Q- fc+1 - j?y~Vfc , (8) 

k=l k=l 

where is a ±l-valued spin at site #k, J > is the spin-spin coupling constant, 
and B the external magnetic field. Index addition in Eq. (8) is modulo N . Many 
stochastic-dynamical schemes are consistent with this energy and we use the follow- 
ing. A site, k', is randomly selected and the change in energy that would occur if 
that spin were flipped, AE = H(ak' — > — ov) — if (original), is calculated. The flip 
is then implemented with probability a exp(— AE/2T), where T is a temperature 
parameter and a a global constant chosen small enough so that all components of 
the transition matrix are non-negative. A second allowed process is a double flip 
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by opposite pointing neighbor spins. The probability that this occurs is governed 
by an additional controllable parameter. In this model there is no breakdown in 
analyticity, but for T < 1 and B = the system will spend most of the time strongly 
polarized. (The stationary, equilibrium distribution as a function of magnetization 
is strongly peaked at symmetric nonzero values.) Above T = 2 the system is found 
mostly at small magnetization values. 
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Magnetization Magnetization 
Figure 3. Values of the left eigenvectors, Ai and A 2 , as a function of magnetization. 
Temperature is 0.6. Ten spins are used and B = 0. 



The transition matrix, R, in this case is 2 x 2 , but it is sparse, so it is not 
a problem to look at N = 10 or larger on a PC. As expected, for T < 1 there is 
near degeneracy of 1 = Ao and Ai. (As T increases above 1, the gap, 1 — Ai, rapidly 
increases.) 

In Fig. 3 we plot the values of the left eigenfunctions, A\ (whose eigenvalue 
is nearly degenerate with 1 for low temperature) and A2, as a function of magne- 
tization. That is, while there are 2^ states, the magnetization (Y2 a k) only takes 
N + 1 values, and it is these that we place on the abscissa. At this temperature, the 
first few gaps (1-A) are 1.3xl0" 5 , 4.0xl0" 3 , 6.0xl0" 3 (twice), 1.2xl0" 2 (twice), 
1.3xl0 -2 . Thus the distance function "eT will be heavily influenced by A\. It is 
clear from the figure that magnetization is an emergent macroscopic variable. Not 
only are the two extreme magnetizations sharply distinguished, but for differences 
of size 4 there is no overlap in A\ values. If this is combined with information from 
A2 (on the right in Fig. 3), an additional separation can be made between large 
magnetization (of either sign) and small magnetization states. The stationary state 
is in one of two highly polarized states, and while conventional definitions do not 
attribute to this system a transition, by criteria more suitable for finite systems (in 
particular those of [9]) there is such a transition. For reference, in Fig. 4 we plot 
the equilibrium probability as a function of magnetization. 
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Figure 4. Probability distribution as a function of magnetization for the one- 
dimensional Ising model at T = 0.6 and B = 0. 



Even for temperatures such that the system is not highly polarized and is 
mostly found at small values of magnetization, one still finds that magnetization 
is an emergent macroscopic variable. This can be seen in Fig. 5, where the eigen- 
function A\ is shown as well as the probability distribution, both as functions of 
magnetization. The distinction in A\ values among different magnetization states 
is even stronger than at low temperature, but in practice this eigenvector will not 
weigh so heavily in the distance function because the lead eigenvector is far from 
degenerate. Specifically the first few gaps (1 — A) are now 0.015, 0.051, 0.10, 0.15 
(twice), 0.16, 0.19. This is in sharp contrast to the low temperature situation where 
the smallest gap was orders of magnitude smaller than the next one. 

For this example we went a step further and calculated the actual grains. The 
distance (squared) matrix derived from Eq. (5) was fed into an algorithm designed 
to minimize the sum of internal cluster distances. What emerged was a sorting 
according to magnetization, both above and below the transition. In Fig. 6 we 
show the grains that result from this procedure. The magnetization of the states 
within each grain is given in Table 1. 
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Figure 5. Value of the left eigenvector, A\, and probability distribution of the 
stationary state, as a function of magnetization. Temperature is 2.25. Six spins are 
used and B = 0. 



6. Heat flow 

Heat flow represents the diffusion of phonons and we model this in the following 
way. N 2-level systems lie on a line. The two levels at each site will be called up 
and down, excited or unexcited. At one end of the line is a source of heat. The 
stochastic rule is that if the left-hand site is down, there is some probability that 
it will become up, absorbing energy from the hot reservoir. That probability is 
related to the temperature of the source. On the right the opposite can happen. If 
the rightmost site is up, there is some probability that it will drop, modeling the 
passage of that energy to the cold reservoir. At all other sites there is no preferred 
direction. An adjacent pair of sites is randomly selected and if one is up and the 
other down, there is a fixed probability that they will exchange levels of excitation. 
With this rule there is linear temperature dependence from one end to the other 
(where "temperature" means time-averaged excitation). 

Coarse graining of the entire state space tends to put states of the same total 
excitation into the same grains. We provide data here on a more detailed kind of 
coarse graining, related to density. Restrict attention to subspaces with a fixed level 
of excitation. Among these we define a distance using Eq. (5). Not unexpectedly, 
grains tend to contain states with similar spatial distribution of excitation. In Fig. 
7 we show the grains for the case of 10 sites and excitation levels 2 (for which there 
are 45 states) and 3 (with 120). Once again coordinate space emerges as a relevant 
macroscopic quantity. There is also the beginning of the emergence of a macroscopic 
"density." 
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Figure 6. Coarse grains, as computed from the distance function given in Eq. (5), 
using a cluster internal distance minimization algorithm. The system is the one 
dimensional Ising model at temperatures 0.8 (left) and 2.25. The grains are read 
from the figures in the following way. Each row represents a state of the system, 
with blackened regions corresponding to excited sites. Three empty grayish rows 
separate each pair of grains. 



7. Discussion 

Coarse graining based on dynamics has been shown to recover at least some 
aspects of the macroscopic world, in particular it accounts for the emergence of 
some of the less obvious collective variables. The "obvious" variables correspond 
to configuration space and indeed they do emerge in a natural way. But others are 
well defined by virtue of collective behavior, a feature that dynamical criteria can 
select. We have shown that density and magnetization fall into this category. 

But the story is far from complete. We did not argue for a unique choice of 
distance function on the global state space, but rather offered several possibilities. 
Our criteria for choosing were mainly theoretical and aesthetic, additivity over 
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Table 1. Magnetization of the states within a grain. Each row of the left and 
right tables corresponds to a grain and lists the magnetization of each state in that 
grain. Note that within each grain the values of magnetization are almost all equal, 
demonstrating that with the definition (6) the order parameter, magnetization, is 
an emergent quantity. 



tensor products or symmetry. Ideally these differences would be expressed in the 
laboratory, for example by changing definitions of coarse grains, one changes entropy 
and ultimately temperature, through ^ = However, the enormous numbers 
involved in entropy definitions — which has permitted rather loose definitions of 
coarse grains — will probably preclude easy experimental tests. A second kind of 
ambiguity relates to the selection of a time scale. Here too one would hope for 
physical input, although in this case one can easily imagine that as the time scale 
changes, what one (coarser) observer would call a fluctuation another would consider 
deterministic time evolution. 

This last remark emphasizes that the quantitative implementation of coarse 
graining that we have initiated in this paper moves the Second Law of Thermody- 
namics from a law of Nature to a convenient tautology [18]. Another perspective 
on this can be seen by recalling an observation in [6]. In that article the following 
is shown: Coarse graining a distribution function, evolving it forward, and then 
again coarse graining, either increases the entropy or leaves it unchanged. This is 
effectively a "proof of irreversibility, but the demonstration relied on an important 
physical assumption, to wit, by the time the second coarse graining is implemented, 
there must be local equilibrium within each grain. Of course this is the same point 
that is made in [11]. But the implication is that with faster observations there will 
no (or less) entropy increase [19]. 

In the introductory section we gave as an example of coarse grain ambiguity 
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Figure 7. Coarse grains, as computed from the distance function given in Eq. (5), 
using an internal cluster distance minimization algorithm. Two parameters are 
varied: t, the time scale, and M the number of sites excited. The four figures 
correspond respectively to (t, M) = (1, 2), (10, 2), (1, 3), (10, 3). The grains are read 
from the graphs as in the previous figure. Thus for t = 1, M = 2 the first grain 
corresponds to the upper 3 rows of the leftmost figure. All states have two excited 
sites and in this grain, 2 out of the first 3 sites are excited in all cases. 



the ability or inability, as the case may be, to distinguish C from C. How is 
that addressed by our ideas? Consideration of this example forces the realization 
that the "dynamics" that enters our matrix "i?" must include the observer. An 
observer that is able to distinguish these isotopes will do so by inducing microscopic 
interactions that percolate upward to the macroscopic level. In a way this is like the 
situation in quantum mechanics, but with no a priori limitation on the weakness of 
the interaction. 
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